[TIDY FIRST] Type `BaseRunner` class #10607

QMalcolm · 2024-08-26T19:20:18Z

Resolves #10606

Problem

The BaseRunner class is largely untyped. Given the prevalence of this class, not having it typed could mean that there are potential edge cases and/or runtime exceptions we are not aware of. By typing this class, we give ourselves a lot more guarantees

Solution

Add typing. Of note, adding this typing uncovered a potential runtime exception in the on_skip method. We address this in 5adce38.

Checklist

I have read the contributing guide and understand what's expected of me.
I have run this code in development, and it appears to resolve the stated issue.
This PR includes tests, or tests are not required or relevant for this PR.
This PR has no interface changes (e.g., macros, CLI, logs, JSON artifacts, config files, adapter interface, etc.) or this PR has already received feedback and approval from Product or DX.
This PR includes type annotations for new and modified functions.

The `BaseRunner` class is a widely inherited class, and it's untyped. Sometimes the wrong thing gets passed into the classes that inherit `BaseRunner`. The "subclasses" of `BaseRunner` generally further restrict some of the available typing, (f.x. the type of node that can be passed in). In order to properly type those subclasses, we first should type `BaseRunner`. This is the start of that.

As part of this I had to switch the typing of node from `GraphMemberNode` to `ManifestNode`. We needed to do this be because `GraphMemberNode` was _too_ wide. Specifically, `GraphMemberNode` includes the `Metric` node which doesn't inherit from `NodeInfoMixin` and thus does not have a `update_event_status` method. Unfortunately moving to `ManifestNode` _excludes_ some nodes from the allowed typing that we'll probably later on find we need to include. Maybe that means some nodes should be moved to the `ManifestNode` definition. We might find that it would be more appropriate however to add an `ExecutableNode` or `RunnableNode` protocol definition, and then use that for typing.

…_result` of `BaseRunner` There are two type errors in this commit. The type hinting of `RunResult.node` and the `node` argument of `_build_run_result` are _different_! I tried to fix this in a previous commit by going from `GraphMemberNode` too `ManifestNode` as `GraphMemberNode` was too broad. It seems `ManifestNode` is too narrow unfortunately. Checking the typing of `RunResult.node`, which I should have done sooner, it uses `ResultNode`, which makes sense. The second type error is that the type hinting of `RunResult.status` and the `status` arguement of `_build_run_result` are _different_. The correct typing we should be that of `RunResult.status`. Both of these typing issues will be resolved in the next commit.

… in `BaseRunner`

We did this because in `BaseRunner.compile_and_execute` we weren't getting type completion for accessing `ctx.node.node_info`. Said another way, we didn't have the type context in `compile_and_execute` to know if a `node_info` property existed on the `node` of the `ctx`. By improving type hinting for `ExecutionContext` we were able to resolve this.

Because `execute` didn't have it's return type specified, it appeared to `run` that there was no expected return value. In turn `compile_and_run` was saying that the `result` it would return would _always_ be `None`. However that is most assuredly _not_ the case, and typing `execute` solves this.

Specifically methods: - `_handle_catchable_exception` - `_handle_internal_exception` - `_handle_generic_exception` - `handle_exception` Additionally we are defining "catchable errors" twice now. Once as a union of types in `_handle_catchable_exception` and as a tuple of classes in `handle_exception`. We should look at refactoring this into a single definition sometime, as a change to one should 1-1 correspond to a change in the other.

…eRunner`

These two methods are essentially just additional logging based for the runner. They should never return anything. Typing them as such helps guarantee that so people don't end up doing some weird things.

In adding type hinting to `on_skip`, and `do_skip`, a potential runtime error was uncovered wherein we were accessing `self.skip_cause.status` in `on_skip` when the `skip_cause` can be `None`. On that edge case, a runtime error would be raised given how we were attempting to access the `status` property. We now handle this edge case, and we only discovered it because of the additional typing.

codecov · 2024-08-26T19:37:59Z

Codecov Report

Attention: Patch coverage is 92.50000% with 3 lines in your changes missing coverage. Please review.

Project coverage is 88.86%. Comparing base (bba020f) to head (f500372).
Report is 112 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main   #10607      +/-   ##
==========================================
- Coverage   88.91%   88.86%   -0.05%     
==========================================
  Files         180      180              
  Lines       22755    22767      +12     
==========================================
+ Hits        20232    20233       +1     
- Misses       2523     2534      +11

Flag	Coverage Δ
integration	`86.11% <92.50%> (-0.11%)`	⬇️
unit	`62.34% <90.00%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Components	Coverage Δ
Unit Tests	`62.34% <90.00%> (-0.01%)`	⬇️
Integration Tests	`86.11% <92.50%> (-0.11%)`	⬇️

.changes/unreleased/Under the Hood-20240826-141843.yaml

Co-authored-by: Emily Rockman <emily.rockman@dbtlabs.com>

QMalcolm · 2024-11-08T22:46:50Z

The BaseRunner and it's derivatives unfortunately need to be refactored before we can properly type them 😞

QMalcolm added 3 commits August 23, 2024 10:58

Add typing to BuildResult.get_result_status

d678d73

QMalcolm requested a review from a team as a code owner August 26, 2024 19:20

cla-bot bot added the cla:yes label Aug 26, 2024

QMalcolm force-pushed the qmalcolm--typing-base-runner branch from 11abfa8 to fb6c1ba Compare August 26, 2024 19:21

QMalcolm added the artifact_minor_upgrade To bypass the CI check by confirming that the change is not breaking label Aug 26, 2024

QMalcolm added 13 commits August 26, 2024 14:33

Add type hinting for BaseRunner._build_run_result

2092a70

Fix typing of node in BaseRunner init and other methods

f4dd99b

Add ResultStatus type union definition and use to fix status typing…

15ef78b

… in `BaseRunner`

Add type hinting to BaseRunner.compile_and_execute

b909090

Add type hinting to safe_run and _safe_release_connection of `Bas…

3bd2972

…eRunner`

Add type hinting to before_execute and after_execute of BaseRunner

d274046

These two methods are essentially just additional logging based for the runner. They should never return anything. Typing them as such helps guarantee that so people don't end up doing some weird things.

Add type hinting to BaseRunner._skip_caused_by_ephemeral_failure

c3512ed

Add changie doc for BaseRunner type hinting improvements

3b2481b

QMalcolm force-pushed the qmalcolm--typing-base-runner branch from fb6c1ba to 3b2481b Compare August 26, 2024 19:34

emmyoop reviewed Aug 28, 2024

View reviewed changes

.changes/unreleased/Under the Hood-20240826-141843.yaml Outdated Show resolved Hide resolved

Update .changes/unreleased/Under the Hood-20240826-141843.yaml

f500372

Co-authored-by: Emily Rockman <emily.rockman@dbtlabs.com>

MichelleArk added the tidy_first "Tidy First" incremental cleanup changes label Sep 3, 2024

QMalcolm closed this Nov 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TIDY FIRST] Type `BaseRunner` class #10607

[TIDY FIRST] Type `BaseRunner` class #10607

QMalcolm commented Aug 26, 2024 •

edited

Loading

codecov bot commented Aug 26, 2024 •

edited

Loading

QMalcolm commented Nov 8, 2024

[TIDY FIRST] Type BaseRunner class #10607

[TIDY FIRST] Type BaseRunner class #10607

Conversation

QMalcolm commented Aug 26, 2024 • edited Loading

Problem

Solution

Checklist

codecov bot commented Aug 26, 2024 • edited Loading

Codecov Report

QMalcolm commented Nov 8, 2024

[TIDY FIRST] Type `BaseRunner` class #10607

[TIDY FIRST] Type `BaseRunner` class #10607

QMalcolm commented Aug 26, 2024 •

edited

Loading

codecov bot commented Aug 26, 2024 •

edited

Loading